PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa16g050920.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 699aa    MW: 77440.9 Da    PI: 7.0531
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa16g050920.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox566.9e-1865120156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     ++k  +++++q++e+e++F+++++p+ ++re L ++lgL+  q+k+WFqN+R++ k
  Csa16g050920.1  65 KKKYNRHSEYQIQEMEAFFRECPHPDDKQREVLGRQLGLKPIQIKFWFQNKRTQNK 120
                     789999***********************************************998 PP

2START1624.5e-512194383206
                     HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEE CS
           START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlev 84 
                     a  a++el+++a+ +ep+W+  +      +n  e+ ++f ++ +     +++ea+r++++v+m+++ +ve l++ +  W++++     +a+t+e+
  Csa16g050920.1 219 AIGAMEELLLMAQGGEPLWSGGVtgtsLVLNLVEYKRTFRTGLGprlsgFRIEASRETALVPMNPTGVVEMLMQAN-LWSTMFVgmvgRAMTHEK 312
                     6789****************99976554566667777775555599999***************************.****************** PP

                     ECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE- CS
           START  85 issg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdl 172
                     + ++      galq m+ae+q lsplvp R+++fvRy++q+g+  w++vdvS+d+  ++      ++++++pSg+li++ + g+skvtwvehv++
  Csa16g050920.1 313 LLPDvtgnfnGALQIMTAEYQELSPLVPtRESYFVRYCKQQGDNLWAVVDVSIDHLFPNIH----MKCRRRPSGCLIQEIPSGYSKVTWVEHVEV 403
                     ********************************************************99975....999*************************** PP

                     -SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 173 kgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                     ++r   ++++++++++g+a++a++wvatl+r ce+
  Csa16g050920.1 404 DDREAnQNIFKHFISTGQAFAANRWVATLERRCER 438
                     ***999***************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.98E-1857124IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.0E-1959125IPR009057Homeodomain-like
PROSITE profilePS5007115.93762122IPR001356Homeobox domain
SMARTSM003894.7E-1764126IPR001356Homeobox domain
CDDcd000861.89E-1665123No hitNo description
PfamPF000461.6E-1565120IPR001356Homeobox domain
PROSITE profilePS5084837.688208441IPR002913START domain
SuperFamilySSF559611.77E-30209440No hitNo description
CDDcd088757.17E-110212437No hitNo description
SMARTSM002345.7E-42217438IPR002913START domain
PfamPF018526.0E-42219438IPR002913START domain
Gene3DG3DSA:3.30.530.207.5E-6271403IPR023393START-like domain
SuperFamilySSF559611.04E-12462658No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 699 aa     Download sequence    Send to blast
MSESNMVPVG NNGDNDNKEN NVMNNTDGGA GVGAGSGAEE IDSAKTVSDN REEAMGSNEG  60
PPRKKKKYNR HSEYQIQEME AFFRECPHPD DKQREVLGRQ LGLKPIQIKF WFQNKRTQNK  120
NHQEHCENIG LRSLNDKLSS DNQRFREAIN ELTALAESFT SKMVINNPVM SPRPLHRPPT  180
FEFGARSFTG RELYGNGGNL SRGNTGPADA DKPMIIDLAI GAMEELLLMA QGGEPLWSGG  240
VTGTSLVLNL VEYKRTFRTG LGPRLSGFRI EASRETALVP MNPTGVVEML MQANLWSTMF  300
VGMVGRAMTH EKLLPDVTGN FNGALQIMTA EYQELSPLVP TRESYFVRYC KQQGDNLWAV  360
VDVSIDHLFP NIHMKCRRRP SGCLIQEIPS GYSKVTWVEH VEVDDREANQ NIFKHFISTG  420
QAFAANRWVA TLERRCERIA SNSIITTDFQ SVDSADHLAL TDHGKMSILK LAERVVRSFF  480
VGLTNSMGTT FSGVRGKDIR VMKMKNLNDP GRPPGVVLSA STSFWVPVPP TFVFDFLRNE  540
DHRAHWDVLC NGWIVQKISE TVNGRDSRNC TTILKNKSTC QTKKIIIQET FNDPTASFVI  600
YAPVDTTSIE GVLYAGTDPD YVALLPSGFA ILPDGIGDQP GGNGGGSLLT VSLQMLVEEV  660
PLGKLSTSSV KFVEKLFRAT EMRIKDVFPL HIATPSTR*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010417818.10.0PREDICTED: homeobox-leucine zipper protein HDG3-like
SwissprotQ9ZV650.0HDG3_ARATH; Homeobox-leucine zipper protein HDG3
TrEMBLR0HIB70.0R0HIB7_9BRAS; Uncharacterized protein
STRINGAT2G32370.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G32370.10.0homeodomain GLABROUS 3